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The Biotechnology Systems Branch of the Scientific and Technical 
Information Center (STIC) no errors detected. 

Application Serial Number: O*?/ $3D,£yA 

Source: l-KtAAJ 
Date Processed by STIC: f—ff-Lf 
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RAW SEQUENCE LISTING 

PATENT APPLICATION; US/09/830, 691A 



DATE : 01/05/2005 
TIME; 14:49:08 



input set : N:\Crf4\Refbold\09_folder\X83 069lA.raw 
Output Set: Ns \CRF4\01062005\IS3069XA, raw 

1 <110> APPLICANT: Choi, Eui-Sung 

2 Hhee, Sang-Ki 

3 Sohn, LJung-Hoon 

4 Park, Soo-Dong 

5 Lee r Yoon-Hyoung 

6 Lee, Seung-Jae 

7 Jang, Jae-Kweon 
S Choi, Seok-Keun 
9 Son , Young - Rok 

10 <120> TITLE OF INVENTION: VECTOR FOR THE TRANSFORMATION OF F RAFFIA 

11 RHODOZYMA AND PROCESS OF TRANSFORMATION THEREBY 

12 <13 0> FILE REFERENCE: 118-12-US-WO 

C--> 13 <140* CURRENT APPLICATION NUMBER*. US/09/83 0 , 691A 

14 <141> CURRENT FILING DATE i 2001-04-2$ 

15 <150> PRIOR APPLICATION NUMBER: KR 1998/46547 

16 <151> PRIOR FILING DATE: 1998-10-31 

17 <150> PRIOR APPLICATION NUMBER: PCT/KR99/00265 

18 <151> PRIOR FILING DATE: 1999-05-29 

19 <160> NUMBER OF SEQ ID NOS: 20 

20 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

22 <210> SEQ ID NO: 1 

23 <211?> LENGTH: 1223 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Phaffia rhodozyma 

26 <400> SEQUENCE : 1 

2 7 atggtcaacg ttcccaagac tegacgtgag ttatagcaat ttcaacaact ctccagacga 

28 caaatattcc agtgcatcga aagagtttgt ggataaacgc gacagtttca agggaaagag 

2 9 tcgatggaca gatttggaag acttagccgg tcaaggaact tggggatcac gtggcggagg 

3 0 actcatcaga agaagtcggg atttgtttga tcatagtggg atcaagacaa actggaggat 

31 atggctcgec ttggaaggga afcctcaggcc tggattcgag gatccgaaag ttgtacgtat 

32 ggaaaagctt acacggcttg gatttattat cttteatagg aacctactgc aagggtaagg 
3 3 cttgcaagaa gcacacgtaa gtcgcttatc ctctccactc tttcatggca tattgtcaac 

34 gactggacaa cgcgtccgtt ttgaaacaag tgacttacct gtgaaatttg attctacacc 

35 tgtatttagc cctcacaagg tacatatcac atcctcccac cccaccctgc ccaacttctt 

36 cagttcatct tgctctcggt ttccacattc cctgatgacc tccttgtatg ttctttgega 

37 acgtttgttt ctgtttctgt aggtgaccca gtacaagaag ggaaaggact ccatcttcgc 
3 8 ccagggaaag cgacgatacg accgaaagca gtccggttac ggaggtcaga ccaagcccgt 
3 9 tttccacaag aaggctaaga ccaccaagaa ggtcgtcctt cgattggcgg tatttttgtt 

40 tattttgaat tctttttgtg tatgcagact tttgatgatt atgctcctct gtcgtttttt 

41 ctcttcaaac agagtgctcc gtctgcagtt cgttcttcct tccaaccaaa acttcaacta 

42 cagacatcat aaacagacat cttacttcgg tgttctctct ttttttccgc agagtacaag 

43 atgcagatga ccctcaagcg atgcaagcac ttcgagcttg gaggagacaa gaagaccaag 

44 ggttcgtctt ttgtccatat attctetggt tcacttctta tgttcctaac gtacttgttt 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



IFWO 
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RAW SEQUENCE LISTING DATE: 01/06/2005 

PATENT APPLICATION: US/09/830, 691A TIME: 14:49:08 

input Set : N;\Cr£4\Refhold\09_folder\lS30691A.araw 
Output Set: N:\CRF4\01062005\I83Q69XA, raw 



45 




cctttttggt 


tcggatgttg tttctategg tggtgttttc 


ttttctttgg atgeattate 


1140 


46 




atttatcgtg 1 


ttggactgtt ttcctctgct cgtttctttc 


tcctctgtac ttgtgcttct 


1200 


47 




caggagccgc < 


catctctttc taa 




















1223 


49 


<210> 


SEQ 


ID NO; : 


2 




























50 


<211> 


LENGTH: 


350 






























51 


<212> 


TYPE : DNA 






























52 


<213> 


ORGANISM: Phaffia rhodozyma 






















53 


<220> 


FEATURE 
































54 


<221> 


NAME /KEY: CDS 




























55 


<222> 


LOCATION: (30) ♦ 


♦ ♦ (347) 
























56 


<400> 


SEQUENCE; 2 






























57 




cccttcaagt < 


ctcgtctcaa tcagtcaag 


atg 


gtc 


aac 


gtt 


ccc 


aag 


act 


cga 


53 


58 




















Met 


Val 


As EL 


Val 


Pro 


Lys 


Thr 


Arg 




59 




















l 








5 










60 




cga 


acc 


tac 


tgc 


aag 


ggt 


aag 


get 


tgc 


aag 


aag 


cac 


acc 


cct 


cac 


aag 


101 


61 




Arg 


Thr 


Tyr 


Cys 


Lys 


Gly 


Lys 


Ala 


Cys 


LyS 


Lys 


His 


Thr 


Pro 


His 


Lys 




62 






10 










15 










20 












63 




gtg 


acc 


Cag 


tac 


aag 


aag 


gga 


aag 


gac 


tec 


ate 


ttc 


gec 


cag 


gga 


aag 


149 


64 




Val 


Thr 


Gin 


Tyr 


Lys 


Lys 


Gly 


Lys 


ASP 


ser 


He 


Phe 


Ala 


Gin 


Gly 


Lys 




65 




25 










30 










35 










40 




66 




cga 


cga 


tac 


gac 


cga 


aag 


cag 


tec 


ggt 


tac 


gga 


ggt 


Cag 


acc 


aag 


ccc 


197 


67 




Arg 


Axg 


Tyr 


Asp 


Arg 


Lye 


Gin 


Ser 


Gly 


Tyr 


Gly 


Gly 


Gin 


Thr 


Lys 


Pro 




68 












45 










50 










55 






69 




gtt 


ttc 


cac 


aag 


aag 


get 


aag 


acc 


acc 


aag 


aag 


gtc 


gtc 


ctt 


cga 


ttg 


245 


70 




Val 


Phe 


His 


Lys 


Lys 


Ala 


Lys 


Thr 


Thr 


Lys 


Lys 


Val 


Val 


Leu 


Arg 


Leu 




71 










60 










65 










70 








72 




gag 


tgc 


tec 


gtc 


tgc 


aag 


tac 


aag 


atg 


cag 


atg 


ace 


etc 


aag 


cga 


tgc 


293 


73 




Glu 


Cys 


Ser 


Val 


Cys 


Lys 


Tyr 


Lys 


Met 


Gin 


Met 


Thr 


Leu 


Lys 


Arg 


Cys 




74 








75 










80 










85 










75 




aag 


cac 


ttc 


gag 


ctt 


gga 


gga 


gac 


aag 


aag 


acc 


aag 


gg a 


gec 


gec 


ate 


341 


76 




Lys 


His 


Phe 


Glu 


Leu 


Gly 


Gly 


Asp 


Lys 


Lys 


Thr 


Lys 


Gly 


Ala 


Ala 


He 




77 






90 










95 










100 












78 




tct 


ttc 


taa 




























350 


79 




Ser 


Phe 
































80 




105 


































82 


<210> 


SEQ 


ID NO: ; 


3 




























63 


<211> 


LENGTH: 


106 






























84 


<212> 


TYPE i PRT 






























85 


<213> 


ORGANISM: Phaffia rhodozyma 






















86 


<400> 


SEQUENCE : 3 






























87 




Met 


Val 


As 11 


Val 


Pro 


Lys 


Thr 


Arg 


Arg 


Thr 


Tyr 


Cys 


Lys 


Gly 


Lys 


Ala 




88 




1 








5 










10 










15 






89 




cys 


Lys 


Lys 


His 


Thr 


Pro 


His 


Lys 


val 


Thr 


Gin 


Tyr 


Lys 


Lys 


Gly 


Lys 




90 










20 










25 










30 








91 




ASp 


Ser 


He 


Phe 


Ala 


Gin 


Gly 


Lys 


Arg 


Arg 


Tyr 


Asp 


Arg 


Lys 


Gin 


Ser 




92 








35 










40 










45 










93 




Gly 


Tyr 


Gly Gly 


Gin 


Thr 


Lys 


Pro 


Val 


Phe 


His 


Lys 


Lys 


Ala 


Lys 


Thr 




94 






50 










55 










60 












95 




Thr 


Lys 


Lys 


val 


val 


Leu Arg 


Leu 


Glu 


Cys 


Ser 


Val 


Cys 


Lys 


Tyr 


Lys 
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9€ 


< 




97 


'] 




98 






99 






100 






102 


<210> 




103 


<211> 




104 


<212> 




105 


<213> 




106 


<220> 




107 


<221> 




108 


<222> 




109 


<223> 




110 


<400> 




111 






112 






113 






114 






115 




w--> 


116 






117 






118 






119 






120 






121 






122 






123 






125 


<210> 




126 


<211> 




127 


<212> 




128 


<213> 




129 


<220> 




130 


<223> 




131 






132 


<220> 




133 


<221> 




134 


<222> 




135 


<223> 




136 


<400> 


w--> 


137 






139 


<210> 




140 


<211> 




141 


<212> 




142 


<213> 




143 


<:220> 




144 


<223> 




145 






146 


<400> 




147 





RAW SEQUENCE LISTING DATE: 01/06/2005 

PATENT APPLICATION: T7S/0 9/830 , 691A TIME: 14:49:08 

Input Set : N:\Crf4\Refbold\09_foldexAl630691A, raw 
Output Set: Ns\CRF4\010620O5\I83069XA.raw 

65 70 75 SO 

Mfet Gin Met Thr Leu Lys Arg Cys Lys His Phe Glu Leu Gly Gly Asp 

85 90 95 

Lys Lys Thr Lys Gly Ala Ala He Ser Phe 
100 105 

SEQ ID NO: 4 
LENGTH; 741 
TYPE : DNA 

ORGANISM; Phaffia rhodozyma 
FEATURE: 

NAME/KEY: misc_f eature 
LOCATION: (0) . . . (0) 

OTHER INFORMATION: n=a, t, C, or g 
SEQUENCE: 4 

ctcgagtgga cggtggcaat ggcattcgtg tcgttggtgc tcactcgcaa cccaagcagt 60 
cgcttacccg gggtagcctc cgggtgggcg cgatgatttg tggtgtggat tccttcccta 12 0 
tgggtagaac gacgcgcaac caatcattcg gagaaccgct ccgttgtagc cgaccagtct 18 0 
gattgatcaa catgccagca cgtcctccgg "gacggagact ggcggggatc gtacctcatc 24 0 
tggaatcgct ggctcaatgg tagtagtctt cacgatcggc catgagggca gtctaggtgg 3 00 
gttcgcctgc cgaagactgt gtgagtgtgc tganaactaa ttgagtaccg ggggafcaagg 360 
caaggcgtgt ntggttgcag gtggctgtga gcgagtttgc tgcaaagcga ttcaatgcac 42 0 
cccggcttgg ccagcgcgct gcgtcacgaa acacactaaa cggttgacgc cataaagtaa 4 80 
taacacactc aagtttgtgg tcccgggtgg gcctctgtgc ctgcgtggga cccgacggga 54 0 
gaggaaaacg ttctgtggcc ctctcctctg tggatagtta cctggttgat cctgccagta 6 00 
gtcatatgct tgtctcaaag attaagccat gcatgtctaa gtataaacaa attcatactg 660 
tgaaactgcg aatggctcat taaateagtt atagtttatt tgatggtacc ttgctacatg 720 
gataactgtg gtaattctag a 741 
SEQ ID NO: 5 
LENGTH: 23 
TYPE i DNA 

ORGANISM; Artificial Sequence 
FEATURE: 

OTHER INFORMATION: CYH1, a PCR primer for the cloning of L41 genomic 
DNA fragment 
FEATURE i 

NAME/ KEY: misc_f eature 
LOCATION: (0) . . . (0) 

OTHER INFORMATION: n=a, t, C, or g 
SEQUENCE : 5 

cgcgtagtta aygtnccnaa rac 23 

SEQ ID NO: 6 
LENGTH: 25 
TYPE; DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: CYH3 / a PCR primer for the cloning of L41 genomic 
DNA fragment 
SEQUENCE : 6 

cccgggtytt ggcyttyttr tgraa 25 
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raw sequence listing date: oi/oe/2005 

PATENT APPLICATION: I7S/O9/830 , 691A TIME: 14:49:08 

Input Set : Hit \Cr £4 \Re*hold\09_£ older \I83 069 1A- raw 
Output Set: N:\CRF4\0106200S\l830691A.raw 



149 


<210> 


SEQ ID NO: 7 




150 


<211;> 


LENGTH: 24 ' 




151 


<212> 


TYPE: DNA 




152 


<213> 


ORGANISM : Artificial Sequence 




153 


<220> 


FEATURE : 




154 


<223> 


OTHER INFORMATION: 3 r RACE primer 




155 


<400> 


SEQUENCE: 7 




156 




ggtcagacca agcaagtttt tcac 


24 


156 


<210> 


SEQ ID NO: 8 




159 


<211> 


LENGTH: 24 




160 


<212> 


TYPE: DNA 




161 


<213> 


ORGANISM: Artificial Sequence 




162 


<220> 


FEATURE : 




1$3 


<223> 


OTHER INFORMATION: 5' RACE primer 




164 


<400> 


SEQUENCE : 8 




165 




gtgaaaaact tgcttggtct gacc 


24 


157 


<210> 


SEQ ID NO: 9 




168 


<211> 


LENGTH: 24 




169 


<212> 


TYPE : DNA 




170 


<213> 


ORGANISM: Artificial Sequence 




171 


<220> 


FEATURE : 




172 


<223> 


OTHER INFORMATION: sense primer for the mutagenesis of L41 


gene 


173 


<400> 


SEQUENCE : 9 




174 




ggtcagacca agcaagtttt tcac 


24 


176 


<210> 


SEQ ID NO: 10 




177 


<211> 


LENGTH: 24 




17$ 


<212> 


TYPE : DNA 




175 


<213> 


ORGANISM: Artificial Sequence 




180 


<220> 


FEATURE 2 




181 


<:223> 


OTHER INFORMATION: ant i sense primer for ttie mutagenesis of 


L41 gene 


182 


<400> 


SEQUENCE: 10 




183 




gtgaaaaact tgcttggtct gacc 


24 


185 


<210> 


SEQ ID NO: 11 




186 


<211> 


LENGTH: 20 




187 


<212> 


TYPE: DNA 




188 


<213> 


ORGANISM: Artificial Sequence 




189 


<220> 


FEATURE 1 




190 


<223> 


OTHER INFORMATION: a PCR primer corresponding to 1SS rDNA 




191 


<400> 


SEQUENCE: 11 




192 




tcctagtaag cgcaagtcat 


20 


194 


<210> 


SEQ ID NO: 12 




195 


<211> 


LENGTH: 20 




196 


<212> 


TYPE: DNA 




197 


<213> 


ORGANISM: Artificial Sequence 




198 


<220> 


FEATURE : 




199 


<223> 


OTHER INFORMATION: a PCR primer corresponding to 18S rDNA 




200 


<400> 


SEQUENCE: 12 




201 




ttcggccaag gaaagaaact 


20 


203 


<210> 


SEQ ID NO: 13 
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RAW SEQUENCE LISTING DATE: 01/06/2005 

PATENT APPLICATION: US/09/83 0 , 691A TIME: 14:49:08 

input Set : Ns\Cr£4\R€£hold\09_£older\I830691A*raw 
Output Set: N:\CRF4\01062005\I83069lA.raw 

204 <211> LENGTH: 2 0 

205 <212> TYPE: DNA 

206 <213> ORGANISM: Artificial Sequence 

207 <220> FEATURE: 

208 <223> other informations a pcr primer corresponding to 2BS rDNA 

209 <;4 00> SEQUENCE: 13 

210 aatcggatta tccggagcta 2 0 

212 <210> SEQ ID NO: 14 

213 <211> LENGTH: 20 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial Sequence 

216 <220> FEATURE; 

217 <223> OTHER INFORMATION: a PCR primer corresponding to 28S rDNA 

218 <400> SEQUENCE; 14 

219 gctataacac atccggagat 20 

221 <210> SEQ ID NO: 15 

222 <211> LENGTH: 2192 

223 <212> TYPE: DNA 

224 <213> ORGANISM: Phaffia rhodozyma 

225 <400> SEQUENCE: 15 



22 6 aagagctatt tgaatgacga ccacaagagt gacgatcata ttgagcatag tataccaaag 60 

227 gccaagaggc tgtgtggtgt tctatgagtg gccttgatta tgtgttacat aaataaactg 120 

228 atctcaattt ttcaaatact tgecaacact ttcatatatt cacaccaaaa aaagtcagat 180 
22 9 tggcccacaa agtcagatac acgctcgatc gtcgacgggt tcaagcactt tgtcaggcga 240 

230 aagaaaggcc acagcaccac ccttcaagtc tcgtctcaat caggttcgtc tagctttttg 300 

231 tgtgcaagga tttaccgtct tgatggattt gttcgttgaa agagaggaaa gaacatgctg 3 60 

232 aactgacgaa agtgtgaaca aaaaattgtg attttttcat tgtgtttcgc tggtctcctt 42 0 

233 gctgggttgg gttggatcgg atttatcttc tgtgttggat ggaaaaccct gaatgttctt 48 0 

234 ttcttggaca tcttctaaac tcgacaaaac gattcattcc tcegtactgc tctggttctg 540 

235 cctttttgaa tcgcatcgat aaattcttcc ctcggaacgt tcgatcaatc tccgtcaaac 600 

236 ttatcatcca aaaatctctt ctcgactgcc gccttgctcc ttttcttcgt tctttcctta 660 

237 atccgctttc gactaccctc cttctcttca cactcatagt caagatggtc aacgttccca 72 0 

238 agactcgacg tgagttatag caatttcaac aactctccag acgacaaata ttccagtgca 780 

239 tcgaaagagt ttgtggataa acgcgacagt ttcaagggaa agagtcgatg gacagatttg 840 

240 gaagacttag ccggtcaagg aacttgggga tcacgtggcg gaggactcat cagaagaagt 900 

241 cgggatttgt ttgatcatag tgggatcaag acaaactgga ggatatggct cgccttggaa 960 

242 gggaatctcc ggcctggatt cgaggatccg aaagttgtac gtatggaaaa gcttacacgg 1020 

243 cttggattta ttatctttca taggaaccta ctgcaagggt aaggcttgca agaagcacac 1080 

244 gtaagtcgct tatcctctcc actctttcat ggcatattgt caacgactgg acaacgcgtc 1140 

245 cgttttgaaa caagtgactt acetgtgaaa tttgattcta cacctgtatt tagccctcac 1200 

246 aaggtacata tcacatcctc ccaccccacc ctgcccaact tcttcagttc atcttgctct 1260 

247 cggtttccac attccctgat gacctccttg tatgttcttt gcgaacgttt gtttctgttt 1320 
24 8 ctgtaggtga cccagtacaa gaagggaaag gactccatct tcgcccaggg aaagcgacga 138 0 

249 tacgaccgaa agcagtccgg ttacggaggt cagaccaagc ccgtttttca caagaaggot 1440 

250 aagaccacca agaaggtcgr ccttcgattg ggtacgtttt tgtttatttt gaattctttt 1500 

251 tgtgtatgca gacttttgat gattatgctc ctctgtcgtt ttttctcttc aaacagagtg 1560 

252 ctccgtctgc agttcgttct tccttccaac caaaacttca actacagaca tcataaacag 162 0 

253 acatcttact tcggtgttct ctcttttttt ccgcagagta caagatgcag atgaccctca 1680 

254 agcgatgcaa gcacttcgag cttggaggag acaagaagac caagggttcg tcttttgtcc 1740 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 01/06/2005 

PATENT APPLICATION: US/09/830 , 691A TIME: 14:49:09 

input Set : N:\Crf4\Refhold\09_folder\x830691A. raw 
Output Set: N:\CRF4\01062005\l830691A.raw 

Please Not e: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a cor responding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa* 

Seq#:4; N Pos . 334,371 
Seq#:5; N PCS. 15,18 
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VHRIPICATION SUMMARY DATE: 01/06/2 005 

PATENT APPLICATION: US/09/830 , 691A TIME : 14:49:09 

Input Set = N:\Cr£4\RGfhold\09_£oWer\lS30691A-raw 
Output Set: N:\CRF4\OX062005\l83069lA.raw 

L;13 M:270 C: Current Application Number differs. Wrong Format 
L:116 M;341 Ws (46) "n» or "Xaa""useci f for SEQ ID#:4 after poB. 2 300 
M:34l Repeated in Seo[No=4 

L:137 M:341 W: (46) "n» or "Xaa M used, for SEQ ID# ; S after pos.:0 
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